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Abstract. Falmagne recently introduced the concept of a medium, a combinatorial object encompassing hyperplane 
arrangements, topological orderings, acyclic orientations, and many other familiar structures. We find efficient 
solutions for several algorithmic problems on media: finding short reset sequences, shortest paths, testing whether 
a medium has a closed orientation, and listing the states of a medium given a black-box description. 

1 Introduction 

Motivated by political choice theory, Falmagne JlO| ] (see also Falmagne and Ovchinnikov [11]) recently 
introduced the concept of a medium, a combinatorial object that also encompasses hyperplane arrangements, 
topological orderings, acyclic orientations, and many other familiar combinatorial structures. 

Formally, a medium consists of a set of states transformed by the actions of a set of tokens, satisfying 
certain axioms. That is, it is essentially a restricted type of deterministic finite automaton, without distin- 
guished initial and final states. Tokens may be concatenated to form messages (words, in finite automaton 
terminology). We use upper case letters to denote states, and lower case letters to denote tokens and mes- 
sages; Sw denotes the state formed by applying the tokens in message w to state S. A token t is said to have 
a reverse t if, for any two states S ^ Q, St = Q iff Qt = S. A message is said to be inconsistent if it contains 
some token and its reverse, and consistent otherwise. A message w is said to be vacuous if, for each token t, 
w contains equal numbers of copies of t and t. A token t is said to be effective for S if St / S, and a message 
w is be stepwise effective for S if each successive token in the sequence of transformations of Sw is effective. 
A medium is then defined to be a system of states and tokens satisfying the following axioms: 

1. Each token has a unique reverse. 

2. For any two distinct states S, Q, there exists a consistent message w with Sw = Q. 

3. If message w is stepwise effective for S, then Sw = S if and only if w is vacuous. 

4. If Sw = Qz, w is stepwise effective for S, z is stepwise effective for Q, and both w and z are consistent, 
then wz, is consistent. 

The following are examples of media: 

Permutations. The set of permutations of n items form the states of a medium, with a token txy for each 
ordered pair xy of items that replaces an adjacent pair yx in a permutation by the pair xy, or leaves the 
permutation unchanged if no such pair exists. The reverse of t xy is t yx . 

Topological orderings. For any directed acyclic graph G, one can define a medium with states that are the 
topological orderings of G, and the same swap operations txy as above for each pair of unrelated vertices 
in the graph. When G has no edges, we get the permutation medium on the vertices of G. 

Acyclic orientations. Let G be an undirected graph, define a state to be an acyclic orientation of the edges 
of G, and for any ordered pair xy of adjacent vertices define a token txy that reorients edge (x, y) from x 
to y, if the resulting orientation is acyclic, and leaves the orientation unchanged otherwise. The result is 
a medium. When G is complete, it reduces to the permutation medium on the vertices of G. 



Hyperplane arrangements. Let C be a convex region in M. d , and A be a hyperplane arrangement in C. 
Then the convex cells of A form the states of a medium, with one token ?/, for each halfspace h bounded 
by a hyperplane in A. If a cell S is included in C \ h and shares a facet with a cell S' included in h, then 
Stfr = S'; otherwise St^ = S. In the special case where C = W 1 , with Cartesian coordinates x,-, and A is 
the arrangement of hyperplanes x, = xj for the edges (ij) of an n- vertex graph, this is isomorphic to the 
acyclic orientation medium described above. We can also realize the medium of topological orderings 
by using hyperplanes x\ = xj for all i and j, with C consisting of the points in R d satisfying x, < xj for 
each arc (£,_/) in the given DAG. 

Well-graded families of sets. A family W of subsets of a set X = UW is well-graded [Q] if, for any two 
distinct sets S and Q in W, there exists a sequence 5 = Sq, Si , . . . , S# = Q in W such that \Si i ASj \ = 1 
for 1 < j < k = |5A<2|- (Here, A denotes the set-theoretic symmetric difference.) Falmagne and 



Ovchinnikov [ ]11[ ] show that such a family can be cast as a medium, the states of which are the sets 
in W. To each x in X are associated two tokens i x and d x , defined respectively by: Si x = S U {x} if 
S U {x} G W, and Si x = S otherwise; and Sd x = S \ {x} if S \ {x} € W, and Sd x = S otherwise. 
It is easily verified that the set of states W and the collection of tokens i x and d x satisfy the axioms of 
a medium. Notice that for each x in X, the tokens i x and d x are mutual reverses. As suggested by the 
remaining examples in this section, this structure subsumes many special cases. 

Downward closed set families. Let T denote a family of subsets of a set X = UJ 7 having the property that 
T contains any subset of any of its members. For instance, the independent sets in a graph or matroid 
define such a family. Then T is well-graded and so can be represented as a medium. 

Binary trees. One can encode a binary tree as a set of integers, that give the heap numbers of nodes in the 
tree: the root has number 0, and the left and right children of a node with number i have numbers 2i + 1 
and IX + 2. Let T be the family of sets encoding binary trees with at most k leaves; then T is well-graded 
and can be represented as a medium. We can similarly represent the trees with height at most k. 

Antimatroids and convex geometries. As noted by Doignon and Falmagne well graded families closed 
under union are dual to antimatroids or convex geometries in the sense of Jamison and Edelman [|l] (see 



also Q15||). Such structures can thus also be represented as media. 



Notice that the media constructed in the last four examples are endowed with a natural 'orientation' in 
that the tokens are either adding or removing some element. Orientations of media were examined in JTT| ] 
and we use them also in this paper; in some sense oriented media are equivalent to well-graded set families 
(Lemma The permutation and arrangement examples are given by Falmagne and Ovchinnikov [11] 



who also provide several additional examples of media, including families of partial orders, interval orders, 
semiorders, and biorders. Previous work [[R], 11] has studied the combinatorics and applications of media. 



In this paper, we provide a first study of algorithms for media. 
2 New Results 

In the study of deterministic finite automata, an important concept that arises is that of a reset sequence [|1 



17]: an input word w such that applying this word to a state S yields a transformed result Sw that is indepen- 
dent of S. Thus, if we imagine an ensemble of copies of the automaton, each initially running in a distinct 
initial state, the application of this word will cause the ensemble to be synchronized to a common state. Re- 



set sequences have been applied, for example, in the design of devices for orienting machine parts [|18fl. If a 
reset sequence exists for an «-state automaton, there is one with C(n 3 ) symbols, which can be constructed in 
polynomial time; Cerny [Q] conjectured a tight bound of (n — l) 2 on the length of the shortest reset sequence 
for any automaton, but this is only known for special classes of automata [^,^[^TJ]. Our first result is a tight 
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bound of n — 1 on the length of the shortest reset sequence for a medium. We also describe an algorithm for 
finding a sequence with this length in time linear in the length of a simple description of the medium. 

Second, we study medium-theoretic concepts of distance. We define length functions on tokens and use 
them to define a natural concept of shortest paths in a medium. We show how to find single source shortest 
paths in linear time, and all pairs shortest paths in time quadratic in the number of states. We define a notion 
of the complement of a state in a medium, generalizing set-theoretic complementation in well-graded set 
families, and show how to find complements more quickly than all pairs shortest paths. 

Third, we consider the concept of a closed orientation, introduced and studied by Falmagne and Ovchin- 
nikov [11]. We describe a near-linear time for testing whether an orientation is closed, and a polynomial time 
algorithm for finding a closed orientation if one exists. 

Finally, we study black box oracle-based definitions of media, similar to the black box groups intro- 
duced by Babai and Szemeredi [J3j] and since studied extensively in computational group theory. We use a 
modified version of the reverse search procedure of Avis and Fukuda [Q] to show that the states of a black 
box medium can be listed in time and space per state that is polynomial in the number of tokens of the 
medium, generalizing previously known algorithms for listing the states of the example media described in 
the introduction. 



3 Preliminaries 

In order to perform algorithms analysis with media, we need some conventions for input format and size 
parameters. For any medium, we let n denote the number of states, and r denote the number of tokens. 
One simple format for describing a medium in a computer would be as an n x r matrix the cells of which 
contain the results of applying each token to each state. However, for many media, an even more concise 
representation is possible: define an adjacency list representation of a medium to be an ^-dimensional array, 
where, for any state S, array cell A [S] contains a list of the pairs (?, Q) for which St = Q ^ S. That is, 
we list only the effective transitions of the medium. We let m denote the number of pairs listed in all cells 
of this array. As an example for which m is much smaller than nr, consider a one-dimensional hyperplane 
arrangement formed by n — 1 points on a line: nr = 2n(n — 1) while m = 2(n — 1). 



We now recall some definitions from Falmagne and Ovchinnikov [ 10, 1 1 ]. A consistent message w which 
is stepwise effective for state S is called a straight path from S to Sw; we say more briefly that Sw = Q is a 
straight path. Axiom 2 in the definition of a medium can be strengthened to the statement that for any S and 
Q there is a straight path Sw = Q. For any state Q in a medium, the content of Q (Q, for short) is defined to 
be the set of tokens that can occur in messages w that determine straight paths Sw = Q. 



Lemma 1 (Falmagne [10]). If Sw = Q is a straight path, then Q\S consists of exactly the tokens in w. 



Lemma 2 (Falmagne [ 10]). For any token t and any state Q, exactly one of the two tokens t, t belongs to Q. 



Denote the set of tokens of a medium by T. Falmagne and Ovchinnikov [|ll|] define an orientation of 
a medium to be a partition {T + , T~} of T into positive and negative subsets such that, for each token t, 
exactly one of t and t is positive. By Lemma|, the partition {Q, T\Q} forms an orientation, which we call 
the content orientation of Q. For any orientation {T + , T~ } we call set Q + = Q PI T + the positive content 
of Q. As we mentioned in the introduction, a well-graded family of sets T provides a set-theoretic definition 
of a medium (<Sjf, Tp) equipped with a natural orientation {T^ = {i x \ x G UJ 7 }, = {d x \ x € UJT}}. 
Indeed, we have the following equivalence between well-graded set families and oriented media, for which 
we omit the proof: 
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Lemma 3. For any orientation T~} of a medium (<S, T), the family T of positive contents of states 
in S is well-graded, and (S,T) is isomorphic to (5jf,TJf); that is, there exist two bijections f : S t— > S?r 
and g : T i— > 7jr such that 

St = Q f(S)g(t)=f(Q), (S,QeF,teT). 

Moreover the orientation {T + ,T~} matches the natural orientation {T^, T^} of the medium for T, in that 
t E T + if and only if g(t) £ T^. 



Corollary 1. Any medium with r tokens has at most 2 r / 2 states. 

As a consequence of this, a factor of log n in the running time of an algorithm on a medium is always 
preferable to a factor of r. Even more preferable would be a factor of m/n, due to the following lemma: 



Lemma 4. In any medium, m < n log 2 n. 



Proof. Since media can be defined by well-graded set families, we prove more generally that, in any family 
T of n sets, the number of unordered pairs of sets (P, Q) with |PA<2| = 1 is at most inlog 2 n; each such 
pair contributes two effective transitions to m. This bound appears to be well known (compare Lemma 3 of 



Matousek Jig]) but for completeness we prove it here. 

As a base case for our induction on n, a family of one set has no such pairs. Choose x € UJ 7 , belonging 
to some but not all sets in T, and divide T into two subfamilies T x and T x where T x consists of the members 
of T that contain x and T x consists of the remaining members of T\ let \T X \ = a and \T X \ = b. Then each 
set P in T is involved in at most one pair (P, PA{x}) with one set in each subfamily, so the number of such 
pairs is at most min(a, ft). If a pair (P, Q) is not of this form, then both P and Q belong to the same subfamily. 
So, if M{n) denotes the maximum number of pairs defined by any n-set family, we have a recurrence 

M(n) < max M(a) +M{b) +min(a,b) 

a+b=n 

which solves to the desired bound. □ 



4 Reset Sequences 

Recall that a reset sequence for a medium is a message w such that Sw = Qw for every two states S and Q. 
We now describe an algorithm for efficiently finding short reset sequences. 

Given an oriented medium, one can construct a directed graph G the vertices of which correspond to 
the states of the medium and the arcs of which represent transitions: draw an arc from S to Q whenever 
St = Q ^ S for some positive token t. Axiom 3 of the definition of a medium is easily seen to imply that G 
is acyclic. 

Lemma 5. The graph G constructed from the content orientation of state Q has Q as its unique sink. 

Proof. If Qt = S, St = Q, so t is a member of Q and the edge in G connecting S with Q is oriented as an arc 
from S to <2; thus Q is a sink. Any other state S has a straight path Sw = Q, each step of which corresponds 
to an arc oriented from S towards Q, so S can not be a sink. □ 

Theorem 1. If we are given as input the adjacency list representation of a medium, then in time 0{m) we 
can find a reset sequence wfor the medium with length \w\ <n — \, such that for any state S of the medium, 
Sw = Q. 
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Proof. Construct the graph G as in Lemma |5f it has n vertices and m/2 arcs. In time linear in the size of G, 
find a topological ordering Sq, 5j , . . . , S n _i = Q of the graph. Then output w = Ps PSi ■ ■ ■ Ps„_ 2 ■ 

To prove that w is a reset sequence, we show by induction that, if w\ denotes the /-symbol prefix of w, 
and S is any state of the medium, then Sw,- = 5; for some j > i. As a base case, if i = 0, the statement is 
vacuous. Otherwise, by induction, Swj_i = S;t for some/ > i — 1. Note that w; = w,„i? for the positive 
token t = PSj_ r so Swj = Syt. Since t is positive, Syt = Sj for j >/, with equality only if t is ineffective for 
/. But t was chosen to be effective for 5 ( -_i, soj > i. □ 

We remark that better bounds on the reset sequence length, in terms of r instead of n, may be possible: 
for instance, one can form areset sequence with length (r/2) 2 simply by concatenating r/2 copies of a word 
w containing all positive tokens for the content orientation of a state Q. However for some media (r/2) 2 
may be larger than n — 1. Since t/2 < n — 1, it is natural to hope that a reset sequence of length r/2 always 
exists, but this is not true; for instance there is no such reset sequence for a medium with six states and six 
tokens, formed by the well-graded family of all 1- and 2-element subsets of a 3-element set. 

5 Distances and Complements 

It is natural to define distances in a medium by assigning lengths to the medium's tokens. We define a length 
function for a medium (<S, T) to be a function A : T i— > R, satisfying the constraint that X(t) + A (7) > for 
every token t. We define the length of a path 5w = Q to be the sum of the lengths of the effective tokens on 
the path, and the distance from state S to state Q to be the length of the shortest path Sw = Q. Note that these 
distances satisfy the triangle inequality but need not be symmetric or nonnegative. Then it is not hard to see 
that every straight path is a shortest path, so shortest path computation amounts to finding a straight path 
between the given states. Distances to a single state Q from every other state may be found in time 0[m) 
by following paths in the DAG of Lemma g, and single source shortest paths can be found by reversing the 
edges of this DAG. Somewhat less trivially, we can also speed up the time for all pairs shortest paths relative 
to the time for similar computations in graphs: 

Theorem 2. If we are given as input the adjacency list representation of a medium, then in time 0(n 2 ) we 
can build ann x n table that lists, for each two states S and Q, the distance from S to Q, as well as a token t 
such that St is effective and t belongs to a straight path from S to Q. 

Proof. We expand the representation of the medium to an explicit n x r table of transitions that allows us to 
test in constant time whether a token is effective for a given state. We then perform a depth first traversal of 
the states of the medium, maintaining as we do a data structure that allows us to compute the table entries 
for each traversed state Q. The depth first traversal itself makes at most 2n — 3 transitions before all states 
are reached, and this sequence of transitions can be constructed in time 0(m). The data structure that we 
maintain in the traversal consists of the following components: 

- A doubly-linked list L of pairs (f, A t ), where each t is a token in the content of Q and A t is a pointer to a 
linked list described below. 

- A pointer from each state S ^ Q to the first pair (t, A t ) in L for which St is effective. 

- Linked lists A t for each pair (?, A t ), listing the states pointing to that pair. 

The pointers from each state S to the associated pair (?, A t ) provide half of the information we are trying 
to compute: an effective transition on a straight path to Q from each other state S. We record this information 
for Q when the traversal first reaches Q. The other information, the numeric distances to Q, can easily be 
computed in time 0{n) for each Q by traversing the tree formed by the effective transitions St. 

We initialize the data structure by creating an empty list A t for each token in the content of the initial 
state Q, listing the pairs (t, A t ) in an arbitrary order, and sequentially searching this list to initialize the 
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pointer for each state 5 ^ Q. It remains to describe how to update the data structure as we perform each 
transition Qt = Q' of the depth first traversal. Token t belongs to the content of Q, so prior to the transition 
it is listed as part of some pair (1, Aj) in L. We remove this pair from L, and append a new pair (t, A t ) to 
the end of L, where A t is a new empty list. We must then recompute the pointers from each state 5 that had 
previously been pointing to (i, Aj); we do so by sequentially searching list L, starting at the position of the 
deleted pair (i, Aj), for the first pair (t 1 , A t i) such that St' is effective, and appending S to A t i. We set the 
pointer for Q to the pair (t, A t ) without searching, since t will be the only token in Q' that is effective for Q. 

We finish the proof by analyzing the time used by this algorithm. List L initially contains r/2 pairs, and 
each traversal step appends a new pair to L; therefore the total number of pairs added to L over the course of 
the algorithm is at most r/2 + 2n — 3 = 0(n). The most expensive part of the algorithm is the sequential 
searching to find a new effective pair. For each state S, the sequence of sequential search steps never revisits 
a position in L, so the total number of steps of sequential searching over the course of the algorithm is at 
most n(r/2 + 2n — 3) = 0(n 2 ). Computing numeric distances also takes a total of 0(n 2 ) time, and the 
other data structure update steps take only constant time per traversal step. Therefore, the total time for the 
algorithm is 0(n 2 ). □ 

From the table constructed above, one can construct a straight path Sw = Q for any two states S and Q, 
in time 0(|w|), by repeatedly using the table to find effective tokens in the path. 

Related to distance is the concept of complementation in a medium. We define the complement of a state 

5 to be a state Q such that S and Q have disjoint contents. If a complement exists, it is unique, and is the 
farthest state from S for any length function. We can test whether a medium has a complement for all of its 
states in 0(m) time, very simply, by searching for the complement Q of a single state S, and then performing 
two parallel traversals through the medium starting from these two states, maintaining as an invariant that 
the state visited by the second traversal is complementary to the state visited by the first traversal. This idea 
can be extended to a method for finding all complementary pairs, somewhat less efficiently but still faster 
than the all pairs shortest paths algorithm above. 

Theorem 3. In time 0(nr) we can find all complementary pairs of states in a medium. 

Proof. As in the previous algorithm, we expand the medium representation to one of size 0(nr) that allows 
for fast tests of the effectiveness of a transition as well as for determining whether or not a token belongs 
to the content of a given state. We then perform a depth first traversal of all states Q. At each step of the 
traversal we maintain the following data: 

- A state S 

- A list L of the tokens in Q n 5 

If L is empty, 5 must be the complement of Q. Otherwise, if Q has a complement S' ^ S, then L must contain 
each token on a straight path S'w = S, and is nonempty. To perform a step Qt = Q' of the traversal, we 
remove t from L (if it was there), and add t to L (if it belongs to the content of S). Then, as long as we can 
find a token t' € L with Si' effective, we replace S by Si' and remove t' from L. We search for t' by scanning 
L after each change of S. If we reach a state S with L empty, 5 is the complement of Q; otherwise, Q has 
no complement. We scan L once per change of Q, and once per change of 5. Since Q follows a depth first 
traversal, it changes 0(n) times. Each change of S is accompanied by the removal of an item from L, and 
items are added to L only when Q changes, so the number of changes to S is also bounded by 0[n). Thus, 
the total time is 0(nr). □ 

6 Closed Orientations 

Falmagne and Ovchinnikov [ pi] ] define a closed orientation of a medium to be one in which, whenever 
positive tokens t and t' are both effective for any state S, the messages tt' and t't are stepwise effective for 
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S. For instance, the natural orientation of an antimatroid is closed. Closed orientations have several useful 
properties: for instance, any state can be transformed to any other state by a step wise-effective canonical 
message consisting of a sequence of positive tokens followed by a sequence of negative tokens; additionally, 
like the content orientations used in the previous section, the DAG formed from a closed orientation has 
a unique sink. A naive algorithm for testing whether an orientation is closed (testing each triple S, t, t') 
would take time 0(nr 2 ). We improve this with the following simple observation, which can be viewed as a 
strengthening of Lemma |] showing that the number of effective tokens is small on a per-state basis instead 
of only when totalled. 

Lemma 6. Let T~} be a closed orientation of a medium. Then for any state S there are at most log 2 n 
positive tokens effective for S. 

Proof. Let E denote the set of positive tokens effective for S. Then it follows from the definition of a closed 
orientation that, for each E' C E, there is a distinct state S E / = Sw E i where w E i is formed by concatenating 
the tokens in E' . Thus the medium contains at least 2^ states, from which the result follows. □ 

Note however that the number of negative effective tokens may be as large as n — 1. 

Theorem 4. We can test whether an orientation of a medium is closed in time 0{m log n). 

Proof. In time 0{m) we can determine the set of effective positive tokens for each state. If any state has 
more than log 2 n tokens, we can immediately determine that the orientation is not closed. Otherwise, we test 
each triple 5, t, t' where t and t' are selected only from the positive tokens effective for 5. The number of 
pairs S, t involved in such triples is at most m, and each such pair forms at most log 2 n triples, so the number 
of tests is 0(m\og n). □ 

The method above is non-constructive, in the sense that it may determine that an oriented medium is 
not closed without finding an explicit triple 5, t, t' violating the definition of a closed medium. With some 
additional care we can make it constructive: 

Lemma 7. In any closed oriented medium, let p(S) denote the number of positive effective tokens for state 
S, and let t be a positive token. Then p(St) > p(S) — 1. 

Proof. By the definition of a closed medium, every positive effective token for 5 other than t itself must also 
be effective for St. □ 



Theorem 5. If an oriented medium is not closed, we can find a triple S, t, t' violating the definition of a 
closed medium in time 0(mlog n). 

Proof. We can compute p(S) for each state S in total time 0(m). If some state S and positive token t have 
p(St) < p(S) — 1, then we can compare the lists of effective tokens for S and St in time 0{t) and find a 
token t' that is effective for S but ineffective for St; S, t, and t' form the desired triple. 

Next, if p{St) > p(S) — 1 is always true, but some state has more than log 2 n positive effective tokens, 
we can follow a sequence of positive effective transitions from that state until we find a state S with exactly 
1+ [log 2 n\ positive effective tokens. Let w = t\ . . . ?£ be a message formed by concatenating all such tokens; 
we can perform a breadth first search to find a minimal subsequence w' of w such that Sw' is not stepwise 
effective. Such a subsequence must exist because (by the axioms of a medium) each stepwise effective 
subsequence of w transforms 5 to a distinct state, and there can be at most n states found altogether, but 
there are 2 k > n distinct subsequences of w. This search takes time 0(2 k ) = 0(n). Let w' = vtt' for some 
message v and tokens t and t'; then Sv, t, t' form the desired triple. Finally, if all states have at most log 2 n 
positive effective tokens, we can perform the same search for violating triples used in Theorem ||. □ 



7 



Next, we consider the problem of finding closed orientations for unoriented media. The simplest medium 
without a closed orientation is one with six states and six tokens, formed by the well-graded family of all 1- 
and 2-element subsets of a 3-element set. 

Theorem 6. If we are given an unoriented medium, we can find a closed orientation, if one exists, or deter- 
mine that no such orientation exists, in time O(mr). 

Proof. We reduce the problem to an instance of 2-SAT [[[]], by creating a Boolean variable for each token 
in the medium, where the truth of variable t in a truth assignment for our instance will correspond to the 
positivity of token t in an orientation of the medium. By adding clauses t V t and -if V -if, we guarantee 
that exactly one of the two variables t and t is true, so that the truth assignments of the instance correspond 
exactly to orientations of the medium. We then add further clauses -<t V -if' for each triple S, t, t' such that St 
and St' are effective but Stt' is not stepwise effective. In an orientation corresponding to a truth assignment 
satisfying these clauses, t and t' can not both be positive, so triple 5, t, t' will not violate the definition of a 
closed orientation. The 2-SAT instance can be constructed in time 0(mr), and solved in time 0(t 2 ). □ 

An alternative method for finding closed orientations is based on the observation that any orientation 
with a unique sink Q must be the content orientation of Q. We can test each content orientation using 
Theorem |] However this does not seem to lead to a method more efficient than the one described above. 

7 Black Box Media 

A medium may have exponentially many states, relative to its number of tokens, making storage of all states 
and transitions infeasible. However, for many media, a single state can easily be stored, and state transitions 
can be constructed algorithmically. Thus, we would like to design algorithms for media that do not depend 
on storing an explicit list of states and state transitions. However, we would like to do this in as general way 
as possible, not depending on details of the state representation. This motivates an oracle-based model of 
media, similar to the black box groups from computational group theory [Bl. 

We define a black box medium to be a representation of a medium consisting of the following parts: 

- A list of the tokens of the medium. 

- A procedure transitionFunction that takes as input the representation of a state S and a token t, 
and produces as output the representation of the state St. 

- The representation of a single state Sq. 

We require each state to have a unique representation, so that states can be compared for equality, however 
the details of the state representation and transition function are unavailable. 

For example, all of the well-graded set family examples discussed in the introduction could be imple- 
mented by using a bitmap to represent a set, with the transition function calling another function to determine 
whether a given set belongs to the family, but this type of independence oracle allows more computations 
than we consider here [|l3|,[l4]]. The hyperplane arrangement example could be implemented by using linear 
programming to determine whether a token produces an effective transition. 

As before, we use r to denote the number of tokens of a black box medium, and n to denote the num- 
ber of states. Since any medium has a finite number of states, we can bound the amount of memory re- 
quired to store a single state by a parameter s. We denote by T the amount of time required per call to 
transitionFunction. In this context, we consider "polynomial time" to mean time that is polynomial 
in r and T, and "polynomial space" to mean a space bound that is polynomial in r and s. 

One of the most basic computational problems for a black box medium is to find all of its possible states. 
As with any deterministic finite automaton, one could list all states by performing a depth first traversal of 
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the state transition graph; however, if the medium is large enough to make an adjacency list representation 
infeasibly large, then it would also likely be infeasible to store the set of already-visited nodes needed to 
allow depth first search to avoid repetition of states. As we now show, it is possible instead to traverse all 
states of a black box medium, using an amount of storage limited to only a constant number of states. 

The main idea behind our traversal algorithm is to use a modified version of the reverse search procedure 
of Avis and Fukuda [fj]. The reverse search procedure allows one to list all the states of a general state 
transition system, as long as one can define canonical paths: sequences of transitions from each state to an 
initially given state Sq such that the union of the transitions on all the paths forms a spanning tree of the state 
space, the canonical path tree. If one can quickly determine whether a transition belongs to the canonical 
path tree, then one can list the states by traversing this tree, ignoring the transitions that do not form edges in 
the tree. In our case, canonical paths can be constructed from the content orientation of the initial state So- If 
we have an ordered list of the positive tokens of the content orientation, then a step in a canonical path from 
state S can be found by taking the first token in the list that is effective on S. The difficulty, however, is that 
we are not given the content orientation, and can not construct it without searching the medium, the task we 
are trying to solve. Fortunately, we can interleave the construction of the orientation and the enumeration 
of states, in a way that allows the reverse search algorithm to proceed, taking polynomial time per state and 
using polynomial space. 

Theorem 7. If we are given a black box medium, we can list all states of the medium in time 0(nr 2 T) and 
space 0(s + r). 

Proof. We perform the reverse search procedure described above, building a list of positive tokens in the 
content orientation of So and simultaneously searching the tree of canonical paths, where the canonical path 
from a state to So is found by applying the first effective token stored in the list being built. The data stored 
by the algorithm consists of the current state S in the search, a pointer to a token t in the list of all tokens, 
and the list of positive tokens; we maintain as an invariant that t is the first token for which we have not yet 
searched transition St. We also maintain invariant that, whenever our search reaches a state S, the tokens on 
the canonical path from S to So have already been included in our list of positive tokens. Initially, the list of 
positive tokens is empty, S = Sq, and t is the first token in the list of tokens. 

At each step of the algorithm, we test transition St as follows: if t is already listed in the list of positive 
tokens, it can not be the reverse of a step in a canonical path, and we advance t in the list of all tokens. 
Similarly, if St is ineffective we advance t. If t is effective, then we search the list of tokens for the reverse 
token t satisfying Sti = S. The path formed by composing the transition from St back to S with the canonical 
path from S to So must be straight (otherwise it would include t and t would have been already included in 
the list of positive tokens) so t must be positive and we include it in the list of positive tokens. We then check 
whether t is the first listed positive token that is effective for St. If so, then the transition from S to St is the 
reverse of a canonical step, so we set S to St, reset t to the beginning of the token list, output state St, and 
continue searching at the new state. If not, we advance t and continue searching from t. 

Whenever the token t advances past the end of the list of tokens in our search, this indicates that we 
have exhausted all transitions from the state S. So, we return to the parent of S in the canonical path tree, 
by searching the list of positive tokens for the first one that is effective on S. We must then also reset the 
token t to the appropriate position in the list of tokens; we do this by searching sequentially through the list 
of tokens for the one that caused the parent of S to transition to S. When we advance past the end of the list 
of tokens for state So, the search is complete. 

It is not hard to see that each step maintains the invariants discussed above. One can prove by induction 
on the lengths of canonical paths that each state is output exactly once, when it is found by the reverse of a 
canonical step from its parent in the canonical path tree. The space bound is easy to compute from the set of 
data stored by the algorithm. For each effective transition t from each state S, we perform 0(t) calls to the 
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def mediumStates (sO, tokens, transitionFunction) : 
'''List all states of a black box medium. 

The input is a single starting state sO, a list of tokens, and a procedure that 
performs the transition function (state, token) ->state of a medium. 
The output is a simple generator yielding all states of the medium.''' 

positiveTokens = { } 

# dictionary of tokens that can occur on a straight path to sO. 

# since we need to store something in positiveTokens [t ] , 

# we use it to store the inverse token of t, but this inverse 

# is not actually used by the search algorithm. 

def step (state) : 

'''Return a token that would take the given state on a straight path to sO.''' 
for token in positiveTokens: 

neighbor = transitionFunction (state, token) 

if neighbor != state: return neighbor 
raise ValueError ( "bad medium: unable to find a positive step from " + repr (state)) 

state = sO 
yield state 

tokenSequence = iter (tokens) 
while 1 : 
try : 

token = tokenSequence . next ( ) 
if token in positiveTokens: continue 
neighbor = transitionFunction (state, token) 
if neighbor == state: continue 

# here with an effective nonpositive token. 

# make sure its inverse is listed as positive so we can call step() 
for invtok in tokens: 

if transitionFunction (neighbor, invtok) == state: 
positiveTokens [ invtok ] = token 

# check whether state->neighbor is the reverse of a step 
if step (neighbor ) == state: 

state = neighbor 
yield state 

tokenSequence = iter (tokens) 

except Stoplteration : 

# here after exhausting all tokens for the current state, 
if state == sO: 

return 

# we need to backtrack in the search. 

# the parent state can be found with step() but we also need 

# to find where the parent's search left off in the token sequence, 
parent = step (state) 

tokenSequence = iter (tokens) 
for token in tokenSequence: 

if transitionFunction (parent, token) == state: 

break # found correct point in token sequence 
state = parent 

Fig. 1. Python implementation of reverse search for black box media states. 
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transition function: one scan of the token list to find the reverse t and determine whether St is canonical, a 
second scan of the token list to find the effective transitions from St (if St was canonical), and a third scan of 
the token list to find the correct position of t after returning from St to S. Therefore, the total number of calls 
to the transition function is 0(t 2 ) per state, which dominates the total running time of the algorithm. □ 

We remark that this algorithm can easily be adapted to produce a reset sequence of length n — 1 in the 
same time and space bounds: simply omit the output of each generated state, and instead output token t 
whenever the search returns from a state S to its parent St. 

Figure [j] displays an implementation of our reverse search procedure in the Python programming lan- 



guage (EG]. The yield keyword triggers Python's simple generator protocol [22], which creates an iterator 



object suitable for use in f or-loops and similar contexts and returns it from each call to mediumStates. 



8 Conclusions 



We have discussed several algorithmic problems on media, but it seems likely that some of our algorithms 
are non-optimal and that other interesting problems remain to be solved. 

As one such problem, is it possible to efficiently find the shortest reset sequence for a medium? Finding 
shortest reset sequences for general DFAs is NP-complete ^ but the reduction used to prove this does not 
seem to apply to media. 

Another interesting direction for further research concerns random walks on media, as studied by Fal- 
magne [|l(J and applied by Regenwetter et al. to electoral behavior [|l9|]. In studies of Markov systems, one 
of the most important parameters is the mixing time, which controls the speed at which the system reaches its 
stable distribution from an initial nonrandom configuration. It is easy to construct media with mixing times 
that are exponential in the number of tokens: for instance, form a well-graded set family from the union of 
the powersets of two equal-cardinality disjoint sets; then these powersets have exponentially many members 
but intersect only in the empty set. The same medium (with a random starting state) also shows that it is 
impossible to quickly approximate a uniform sample of a black-box medium, or to quickly list all minimal 
states for the content orientation of a black-box medium. However, in many natural examples of media, the 
mixing time seems to be small. Is there a media-theoretic explanation for this phenomenon? 
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